Xxviii. Speech Communication Academic and Research Staff A. Real-time Spectral Input System for Computer Analysis of Speech
نویسندگان
چکیده
The objective of the research in speech communication is to gain an understanding of the processes whereby (a) discrete linguistic entities are encoded into speech by human talkers, and (b) speech signals are decoded into meaningful linguistic units by human listeners. Our general approach is to formulate theories or hypotheses regarding certain aspects of the speech processes, obtain experimental data to verify these hypotheses, and simulate models of the processes and compare the performances of the models and of human talkers or listeners. Research in progress or recently completed includes: observations of the acoustic and articulatory aspects of speech production in English and in other languages through spectrographic analysis; study of cineradiographic data and measurement of air-flow events; study of the perception of speech sounds by children and examination of the acoustic properties of the utterances of children; computer simulation of articulatory movements in speech; investigation of the mechanism of larynx operation through computer modeling and acoustic analysis; examination of new procedures for analysis of speech signals using deconvolution techniques; experimental studies of the perception of vowel sounds; speech synthesis by rule with a computer-simulated terminal analog synthesizer; a re-examination of the system of features used to describe the phonetic segments of language; and the development and improvement of interface equipment for spectral analysis of speech with a computer and for synthesis of speech from computer-generated control signals. K. N. Stevens, M. Halle
منابع مشابه
Classification of emotional speech using spectral pattern features
Speech Emotion Recognition (SER) is a new and challenging research area with a wide range of applications in man-machine interactions. The aim of a SER system is to recognize human emotion by analyzing the acoustics of speech sound. In this study, we propose Spectral Pattern features (SPs) and Harmonic Energy features (HEs) for emotion recognition. These features extracted from the spectrogram ...
متن کاملA Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملA New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)
Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...
متن کاملSpeech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering
Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...
متن کاملAn Information-Theoretic Discussion of Convolutional Bottleneck Features for Robust Speech Recognition
Convolutional Neural Networks (CNNs) have been shown their performance in speech recognition systems for extracting features, and also acoustic modeling. In addition, CNNs have been used for robust speech recognition and competitive results have been reported. Convolutive Bottleneck Network (CBN) is a kind of CNNs which has a bottleneck layer among its fully connected layers. The bottleneck fea...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009